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Toward Solution of Matrix Equation X = Af(X)B + C 

Bin Zhou* ' James Lam* Guang-Ren Duan* 



Abstract 



This paper studies the solvability, existence of unique solution, closed-form solution and numerical 
solution of matrix equation X = Af {X)B + C with / (X) = X T , f (X) = X and / (X) = X H , where X 
is the unknown. It is proven that the solvability of these equations is equivalent to the solvability of some 
auxiliary standard Stein equations in the form of W = AWB + C where the dimensions of the coefficient 
\ matrices A, B and C are the same as those of the original equation. Closed-form solutions of equation 

X = A f (X) B + C can then be obtained by utilizing standard results on the standard Stein equation. 
On the other hand, some generalized Stein iterations and accelerated Stein iterations are proposed to 
obtain numerical solutions of equation equation X — Af (X) B + C. Necessary and sufficient conditions 
are established to guarantee the convergence of the iterations. 
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1 Introduction 



Matrix equations play a very important role in linear system theory. For example, the continuous-time 
Lyapunov matrix equation AX + XA T = —Q and the discrete-time Lyapunov matrix equation AXA T — X = 
£f") • —Q (which is also known as the Stein equation) have very important applications in the stability analysis of 

continuous-time and discrete-time linear systems, respectively ([IS]). A more general class of equations in 
the form of AX + XB = C, which is called the Sylvester matrix equation, can be used to solve many control 
problems such as pole assignment, eigenstructure assignment, robust pole assignment, and observer design 
(see, for example, [5] and [25]). Other matrix equations such as the coupled Sylvester matrix equations and 
the Riccati equations have also found numerous applications in control theory. For more related introduction, 
see [8], [15], [17], [25] and the references therein. 



Because of the wide applications of linear equations in control theory, many approaches have been established 
in the literature to provide both closed-form and numerical solutions. Take the Sylvester matrix equation 
AX + XB = C for example. The controllability and observability matrices are used in [2] to construct 
the closed- form solutions, and such a solution is also obtained by the inversion of a square matrix in |12j . 
Regarding the numerical methods for solving the Sylvester matrix equation, one of the most efficient ap- 
proaches is the Hessenberg-Schur form based algorithm proposed in [5]. Besides, the iterative algorithm by 
using the hierarchical identification principle proposed in [3J and [5] is also shown to be effective in solving 
this class of equations. A more general form of this class of equations is recently considered in [37]. For more 
references on this topic, see [3], [7], [TT], [3D], [TBJ, [2T], [T5], [33J, [23] and the references given there. 

As a generalization of the continuous-time Lyapunov matrix equation, the following linear matrix equation 

AX + X T B = C, (1) 

has received much attention in the literature over the past several decades. For example, special cases of 
([T]) in the form of AX ± X T A T = B were thoroughly studied in [28], [6] and [10] where necessary and 
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sufficient conditions for the existence of solutions were obtained. Very recently, the solvability and closed- 
form solutions of equation ([TJ is studied in [T!5] by using the Moore-Penrose generalized inverse. As a more 
general case, the linear equation 

AX T B + EXF = -C, (2) 
is investigated in [17] and [22] where iterative approaches are developed to obtain the numerical solutions. 
In this paper, we study matrix equation 

X = Af(X)B + C, (3) 

with / (X) = X T , f(X)=X and / (X) = X H , and A, B, C being known matrices. If / (X)_ = X T , namely, 
X = AX T B + C, then equation §5§ can be regarded as a special case of @. If / (X) = X, equation §5§ 
becomes X = AXB + C which was first studied in [13]. Several problems will be studied for equation ([3]). 
These problems include solvability, existence of unique solution, closed-form solution and numerical solution. 
The key idea is to transform this class of equations into the standard Stein equation (see, for example, [2"5] ) 

W = AWB + C (4) 

where A, B and C are functions of A, B, C, and W is unknown. It is shown that the solvability and the 
existence of unique solution of the original equation ([3]) is equivalent to the solvability and the existence of 
unique solution of equation ([4]). Moreover, we have proven that if the general solution (denoted by W) of 
equation (U) has been obtained, then the general solution (denoted by X) of the original equation ([3]) can 
be obtained via the formulation 

x = \{W + Af (W) B + C) . 

Regarding numerical solutions, we extend our early results on Smith iteration for standard Stein equation 
given in |24j to equation Q and necessary and sufficient conditions are obtained to guarantee the convergence 
of the iterates. Different from the real representation based approach for solving equation X — AXB + C 
developed in [T3] where the matrix equation is transformed into a standard Stein equation whose coefficient 
matrices have dimensions twice of the original coefficient matrices, here the dimensions of the coefficient 
matrices of the auxiliary matrix equation ((4]) is the same as the dimensions of coefficient matrices of the 
original equation ([3]). 

The rest of this paper is organized as follows: Some preliminary results in linear algebra that will be used 
in this paper is recalled in Section [2] A basic result regarding the solution of linear equation is developed in 
Section[31 which plays an important role in the development of this paper. Then, for / (X) = X T , / (X) = X 
and / (X) = A H , the matrix equation in ([3]) is, respectively, considered in Section [4] Section [5] and Section 
|6l A more general case is treated in Section The paper is finally concluded in Section [8] 

2 Notations and Preliminaries 

2.1 Notations and Some Standard Results 

In this paper, for a matrix A, we use A T , A H , A, A {A} , p (A) , rank (A) and det (A) to denote respectively 
the transpose, the conjugated transpose, the conjugate, the spectrum, the spectral radius, the rank and the 
determinant of A. The symbol <g> denotes the Kronecker product. For anmxn matrix R — [ry], the so-called 
column stretching function vec (•) is defined as 

vec (R) = [ r u ■ ■■ r ml ■ ■ ■ r ln ■ ■ ■ r mn ] T . 

For three matrices A, X and B with appropriate dimensions, we have the following well-known results related 
to the column stretching function: 

vec (AXB) = (B T ® A) vec (X) . (5) 
We then can introduce the following standard result. 
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Lemma 1 flTj[ and {T7|/j Let X £ C mx " be any matrix. Then 

vec (X T ) = P (m , n) vec (X) . 



where Pi 



properties. 



\rn,n) * s uniquely determined by the integers m and n. Moreover, the matrix P> m ^ n ) has the following 



1. For two arbitrary integers m and n, P( m ,n) ^ as ^ e following explicit form 



Pi 



(m,n) 



E ll E \2 
?T 
-22 



E 21 E 2 



E lr, 



E 2n 



pT 7?T 171' 



T 

run 



e R 



ran X ran 



where Eij, i = 1, 2, . . . , m,j = 1, 2, . . . , n is an m x n matrix with the element at position (z, j) being 1 
and the others being 0. 

2. For two arbitrary integers m and n, P (m, n) is a unitary matrix, that is 



P P 



~(m,n)- f '( m ,n) = ^(m.n) •^ > (m,n) = J mn- 

5. for two arbitrary integers m and n, there holds P( m , n ) = P( T ~~ " 1 
^. Let m,n,p and q be four integers and A € C mxn ,B 6 C pX9 . XTien 

P (ro>p) (PgsA) = (A0P)P (n>9) . 

Let A g c mx " be any given matrix, the generalized inverse of A, denoted by A~ , is such that 

AA-A = A. 

Then we can recall the following standard result regarding solutions of linear equation. 

Lemma 2 Let A 6 C" iX ™ and b € C m &e given. Then the linear equation 

Ax = b (7) 

is solvable if and only if there exists a A~ such that AA~b = b. Moreover, if it is solvable and denote x^ as 
one of its solutions, then the general solution of equation ^ is given by 

x = + (7„ - A~A) t, 

where t € C m is an arbitrary vector. 

The following result is the well-known Schur complement. 

Lemma 3 (JTj) Let X\,X 2 ,X\ 2 and X 2 \ be some matrices with appropriate dimensions. Assume that X\ 
and X 2 are nonsingular. Then, 



= P~ 
;i ■ m ) (n,m) ' 



(6) 



det 



Xl X\2 

X21 X2 



= det (X x ) det (X 2 - X 21 X^X 12 ) 
= det (X 2 ) det (X t - X l2 X^X 21 ) 

Finally, we recall some facts about linear iteration. 

Definition 1 (1240 Eor a linear iteration 

x k+1 = Mx k +g, jeR", 



(8) 



where M and g are respectively constant matrix and vector and p(M) < 1, the asymptotic exponential 
convergence rate is — ln(p(M)). 



Basically, the smaller the number p (M) , the faster convergence of the iteration ([5]). See [53] for explanations. 
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2.2 Smith Iteration and Closed-Form Solutions of Standard Stein Equation 



Consider the following standard Stein matrix equation 

X = AXB + C, (9) 

where A £ C mxm , B £ C nxn and C £ C mxn are known and X £ C mxn is to be determined. The associated 
Smith iteration is defined as 

X k+1 =AX k B + C, VX £C mxn . (10) 

It is known that the above Smith iteration converges to the unique solution of equation Q for arbitrarily 
initial condition if and only if p(A)p(B) < 1. Moreover, the asymptotic exponential convergence rate is 
— In (p (A) p (B)). Therefore, if p (A) p (B) is very close to 1, the convergence is very slow. To improve the 
convergence rate, the so-called Smith (I) iteration can be applied. The idea of Smith (I) iteration is to 
combine I steps of Smith iterations into one step. Notice that 

Xk+i = AXk+i-iB + C 

= A(AX k+l _ 2 B + C)B + C 
= A 2 X k+ i_ 2 B 2 + ACB + C 



i-i 

= A l X k B l + ^2A l CB l . (11) 
By denoting X k i = Y^, k = 0, 1, . . . , we can rewrite (fTTjl as 

2-1 

Y k+1 = A l Y k B l + A l CB\ VF = X £ C mx ". (12) 

i=0 

It is shown in |24) that the above iteration converges to the unique solution of @ for arbitrarily ini- 
tial condition if and only if p{A)p(B) < 1. Moreover, the asymptotic exponential convergence rate is 
—l\n(p(A) p(B)). Therefore, the convergence rate of (|12p increases as I increases. 

Though Smith (I) iteration p^|) can improve the convergence rate of the standard Smith iteration (fTU|). it 
still converges exponentially. In order to obtain super-exponential convergence rate and by noting that the 
closed-form solution of equation ([9]) is 

oo 

X* = Y / A i CB i , p(A)p(B)<l, 

i=0 

the so-called r-Smith iteration can be constructed. 



Lemma 4 ( 12$ ) Let r > 2 be a prescribed positive integer. Assume that p(A) p(B) < 1. Construct the 
following iteration: 

r-l 

X k+1 = Y, A k X k Bl, A k+1 = A r k , B k+1 = Bl, k > 0, (13) 

i=0 

with initial condition Xq = C, Aq = A, and Bq = B. Denote E k = X* — X k where X* is the unique solution 
of (0). Then lim / ! i: _ ! . oc X k = X* and 

r k -l 

X k = A l CB\ E k = A rk X*B r ", k>0. (14) 

i=0 

If r = 2, the r-Smith iteration reduces to the well-known Smith accelerative iteration ([20]). It is clear 
that the r-Smith iteration (|13p has a super-exponential convergence rate and is very effective in obtaining 
numerical solutions of equation ^ ([24]). 
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In the following, we consider closed- form solutions of the Stein equation (JSJ). Let the characteristic polynomial 
of A be 

m — 1 

h A (s) = det (sl m - A) = s m + a iS \ 

Then it is easy to verify that 



hA (s) = det (I rn - sA) = a m + a m -is\ a m = 1. 



i=l 



We then can cite the following result regarding closed-form solutions of equation ^ . 
Lemma 5 (JWf) Let A e C mxm ,B £ C nxn and C € C mxn be given. Then 
1. If equation (0i has a solution X, then 

m k 

Xh A (B)=J2J2 a k A k - s CB m ~ s . 



fc=l s=l 

2. If equation (0) has a unique solution X*, then 



X*=[J2J2 a k A k - s CB m - s (h A (B)y 1 



\k=l s=l 



3 A Basic Result on Linear Equation 

Before giving detailed discussion to equation ([3]), we present a basic result on linear equation. This result is 
essential in the development of this paper. 

Proposition 1 Let A £ C" xn and c E C n be given. Then the linear equation 

(I n - A 2 ) L) = {I n + A) C, (15) 
is solvable with unknown u> if and only if the linear equation 

{I n -A)x = c, (16) 

is solvable with unknown x. More specifically, 



1. If x is a solution of equation \16\) . then lj = x is a solution of equation U5\) . 

2. If ui is the general solution of U5\). then the general solution of equation U6]) is 

x = ~ (/„ + A) u: + ~c. (17) 

3. If equation \15\) has a unique solution uj* , then equation U6\) also has a unique solution x* = lu*. 

Proof. Proof of Item 1. This is trivial since equation (|T5j) can be written as (/„ + A) {{I n — A) uj — c) =0. 
Proof of Item 2. Let P be some matrix such that 



A = P 



R 
J 
K 



p-\ ReC naXno , JeC niXn \ A'eC" !X " ! , (18) 



where all the eigenvalues of J are 1, all the eigenvalues of K are —1 and {— 1, 1} n A {R} = 0. Clearly, 
+ n i + n 2 = n. Then we can compute 



I n ±A = P 



I n -A 2 = P 



R± 

o j± o p-\ 

K± 

R+R- 

J-J+ 

K+K- 



P- 



(19) 



(20) 



where R± = I no ± R, J± = I ni ± J and K± = I n . 2 ± X. According to Lemma [2] and (|2T))) . the general 
closed-form solution of (fT"5l) can be written as 



u = P 







J+^J^ -JZJ-)s 
KZ 1 (J ni - t 



; f , Vs e R n \Vi £ R™ 2 , 



(21) 



where J_ and X, are any fixed generalized inversion of J_ and respectively, and uj^ is a special solution 
of equation (|15|) , namely, 

(7 n - A 2 ) w t = (/ n + A) c. (22) 

Let 

^^i(/„ + A) w t + I c . 



Then, it follows from ((22} that 



(J n - A) *t = (/„ - A) Q (/„ + A) w t + ± c ) 



= i(/ n -A 2 )a;t + i(J n -A) C 

= - (J„ + A) c + - {I n - A) C 

= c, 

namely, is a particular solution of equation (TT51) . Hence, by using (TT5)) - (j2"Tj) . we have 





(23) 



- (/„ + A) u + -c = -P 



R+ 
J+ 
K+ 



p-ip 



J+ 1 (I ni -JZJ-) s 
KZ 1 (I n2 - K+K+) t 



+ \ (In + A) W t + -c 



2 



-P 







(I ni - JZJ- 



P 



K+KZ L (J„ 2 - KZK+) t 


(l ni -JZJ-)s 
KZ 1 K+ (I n2 - K+K+) t 



(l ni -JZJ-)s 
KZ 1 (K + - K+K7K+) t 







(l ni -JZJ-)s' 




(24) 



where we have used the fact that K+ = K+K + K+ and denoted s' — hs. On the other hand, in view of (|2U)l 
and by using Lemma [5] again, the general solution of equation P^|) can be expressed as 



x = P 





(l ni ~JZJ-)s> 




c\ Vs' G C" 1 . 



(25) 
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Comparing with (f2l)j). we clearly see that x in p7|) is the general solution of equation (TTB)) . 
Proof of Item 3. Equation (|15[) has a unique solution 

a/ = (7„ - A 2 ) 1 (7„ + A) c = (/„ - A)" 1 c, 

if and only if 1 ^ A{A 2 } which implies that 1 ^ namely, equation (fT()]) has a unique solution x* = 

(I n — A) c. The proof is finished. ■ 

We should point out that there is an important difference between Item 2 and Item 1 of Proposition [TJ even 
though x is the general solution of equation (fl6)) , uj = x is not the general solution of equation (p~5|) . This 
can be observed from (121|) and (|25|) . For illustration, we give an example. 

Example 1 Let A and c in equations hi 5)) and il6}) be chosen as 



A = 



" 1 




' " 


-1 


, c = 


1 



Then it is easy to see that the general solutions of equations U6\) and 115\) are, respectively, 





t 






X = 


1 

. 2 _ 


j w = 


s 



It follows that there are two free parameters in oj while there is only one free parameter in x and thus uj = x 
is not the general solution of equation \15\) . 

Also, if equation (fl6l) has a unique solution, we cannot in general conclude that equation (|T5|) has a unique 
solution. The simple counterexample can be constructed by letting -1 € A{A}. 

4 Equation X = AX T B + C 

In this section, we consider equation ([3]) with / (X) = X T , namely, the following matrix equation 

X = AX T B + C, (26) 

where A G C rnxn ,B E c mx " and C € C mx " are some given matrices and X e C rnxn is a matrix to be 
determined. 

4.1 Solvability and Closed- Form Solutions 

By taking vec (•) on both sides of (121)1 and using Lemma [I] we have 

vec (C) = (I mn - (B T ® A) P (m , n) ) vec (X) 

= (P(n,m) " # T ® 4) ^(m.n)vec (X) . (27) 

Therefore, the following simple result can be obtained. 

Lemma 6 Equation i26\) is solvable if and only if 

rank [ P(n,m) - B T ® A vec (C) ] = rank [ P( n ,m) - B T ® A ] . (28) 
Moreover, it has a unique solution for arbitrary C if and only if 

1 i \{(B T ® A)P (m>n) }. (29) 
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Notice that for test condition (f2l?|). we need to compute the eigenvalues of a matrix having dimension mn, 
which is demanding in practice. In the following, we will provide an alternative condition for the solvability 
of this equation. 



Theorem 1 Matrix equation S26\) is solvable if and only if 

W = AB T WA T B + AC T B + C, (30) 
is solvable with unknown W. More specifically, 



1. If X is a solution of equation i26\) . then W — X is a solution of equation h30\) . 

2. If W is the general solution of equation i30\) , then the general solution to equation i :<> i is 



X = 2 ( W + AwTb ) + \ C - ( 31 ) 

3. Equation \2b}) has a unique solution X* if equation h3U\) has a unique solution W* , namely, 

777 ^1, V7J.7G \{AB T }. (32) 

Moreover, X* = W* . 

Proof. Proofs of Item 1 and Item 2. By using (j6]), we know that 

P(m,n) {B T ® A) P {m . n) =A®B T . (33) 

With this, by taking vec (•) on both sides of ([3U)l and denoting ui — vec (W) , we get 

uj = (B T (g) A) vec (B T WA T ) + vec (AC T B) + vec (C) 
= (B T gi A) (A ® B T ) uj + (I mn + (B T ® A) P (m , n) ) vec (C) 
= (B T g) A) P {m>n) (B T <» A) P (m , n) w + (I m „ + (P T (8 A) P (m , n) ) vec (C) 
= Y 2 uj + (I mn + r)vec(C), 

where T = (P T (8 A) P( m , n )- The above equation can also be written as 

(/ ro „ - T 2 ) u = (Imn + T) vec (C) . (34) 

Similarly, taking vec (•) on both sides of (|2^|) and denoting x = vec (X) produce 

{I mn -T)x = vec (C) , (35) 

and taking vec (•) on both sides of expression (|3"Tj) yields 

vec (X) = i (7 m „ + T) vec (W) + ^vec (C) . (36) 

We notice that equation (|3"4")l is in the form of (|T5j) . equation ([33)1 is in the form of (fTo]) and expression 
is similar to (flTl) . Then Item 1 and Item 2 follow from Proposition [I] directly. 

Proof of Item 3. From the above proof and Proposition [1] we conclude that equation (|26[) has a unique 
solution if equation (|30|) has a unique solution which is equivalent to 

li\[{A T Bf ®AB T } = \{B T A®AB T }. (37) 

The remaining is to show that (|32|) and (|37|) are equivalent. Without loss of generality, we assume that 
n > m. Let A {AP T } = {rji, rj 2l ... , f? m }- Then 

A {B T A} = { m , T] 2 ,...,v m , 0,0,..., 0}. 
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As a result, we have 

\{B T A(g>AB T } ={ W7is O,0,...,0},i,j€{l,2,...,m}. 
Therefore, (|37p is true if and only if (f3"2"j) is satisfied. The proof is completed. ■ 



One may ask whether the condition in (|32|) is necessary for the uniqueness of solution of equation (|26|i . The 
answer is negative, as the following example indicates. 



(38) 



Example 2 Consider a linear equation in the form of h26\) with 



A = 



2 
1 a 



, B = 



1 
1 



where a is to be determined. By direct computation, the characteristic polynomial of (_B T ® ^4) P( m ,n) * s 
(s — 2)(s — a)(s 2 — 2a) = 0. Therefore, according to i29\). the equation in I126\) has a unique solution for 
arbitrary C if and only if a ^ 1 and a^|. On the other hand, as A {_B T A} = {2, a}, the condition in f32\) 
is equivalent to a ^ ±1 and |. Hence, if a = — 1, condition \32\) is not satisfied while the equation V2b\) 
has a unique solution. This clearly implies that i32\) is only a sufficient but not a necessary condition for 
the existence of unique solution of equation \26\) for arbitrary C . 



Though Theorem Q] only provides sufficient condition, the advantage is that the condition in (j32j) is easier to 
test than (|29|) because only the eigenvalues of a matrix of dimension m is required. 

Remark 1 If n < m, we need only to compute the eigenvalues of a matrix of dimension n because condition 
Ij32]) can be replaced by 777 ^ 1, V77, 7 G A {£M T } . 

By combining Lemma [5] and Theorem Q] we have the following result regarding the closed- form solutions of 
equation (|26|) . 



Theorem 2 Let A e C mx '\ B e C mxn , C G C mx ™ and 

m 

hAB T ( s ) = det (l m - sAB T ) = a m + a m -is\ a m = 1. 



1=1 



1. If equation \26\) has a solution X, then 

m k 



Xh ABT (A T B) =J2T, ak ( AbT ) ( A C T B + C) (A T B) 



ik—s 

a k [AB\ 

fe=l s=l 

2. If 132]) is satisfied, then equation 126\) has a unique solution X* given by 



Vfc=l s=l 



4.2 Smith Iteration for Equation X = AX T B + C 

By recognizing the Smith iteration (|T0|) for the standard Stein equation (|9|), we can construct the following 
Smith iteration for matrix equation (|26|) : 



X fe+1 = AXjB + C, VX G C mxn . (39) 
In the following, we will study the convergence of this iteration. First, we introduce a lemma. 

Lemma 7 Let A,B G C mx " be given and T = (B T <g) A) P (m ,„). T/ien 

p(Y) = p(B rT A). 
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Proof. By definition of spectral radius and using (f3"3")l. we have 

P 2 m = p(r 2 ) 

= p (((B T ® A) P M ) ((B T ® A) P (m) »))) 
= p ((B T ® A) (P (m , n) (P T ® A) P M )) 
= p((B T ®A) (A®B T )) 
= p(B T A® AB T ) 
= p{B^A)p{AB^) 
= P 2 (B T A) . 

The proof is completed. ■ 

We then can prove the following result. 

Theorem 3 Assume that equation H26\) has a unique solution X* . Then the Smith iteration \39\) converges 
to X* for arbitrary initial condition Xq if and only if 

P (B T A)<1. (40) 
Moreover, the asymptotic exponential convergence rate is — ln(p(i? T A)) . 

Proof. Taking vec(-) on both sides of (|3T))) gives 

vec (Xfe+i) = Tvec (X k ) + vec (C) , 

which converges to a finite vector vec (^oo) independent of the initial condition if and only if p ( T) < 1. This 
is equivalent to (|40|) in view of Lemma [7] Moreover, we have 

X x = AXlB + C. 



The proof is completed by noting the fact that equation (I26p has only a unique solution under the condition 
(SDl). ■ 



Notice that equation (|30|) is a standard Stein equation whose associated Smith iteration is 

X k+1 = AB T X k A T B + AC T B + C, VX € C mxn . (41) 

It is trivial to show that this iteration converges to a constant matrix independent of the initial condition 
Xq if and only if (|4U|) is satisfied. As (|4U|) implies (j3"2")l , we conclude from Theorem Q] that the iteration (l4"Tj) 
also converges to the unique solution X* to equation ([26]) . 



Remark 2 In view of ill]), it is easy to see that iteration 1\ ) can be regarded as Smith (2) iteration 
associated with the Smith iteration Therefore, by using a similar technique as that used in \11\). for 

arbitrary integer I > 1, we can construct Smith (I) iteration for equation &26\) . The details are omitted for 
brevity. 



Remark 3 Under condition it follows from Theorem\Qthat the unique solution of equation i26}) is 

oo 

X* = (AB T Y {AC T B + C) (A T B) 1 . 

i=0 

Therefore, in view of Lemma^ for any integer r > 1, we can construct the following r-Smith iteration 

r-l 

X k+1 =^W*, A k+1 = A r k , B k+1 = B r k , k> 0, 

4=0 

with Xq = AC T B + C, Aq — AB T and Bq = A 1 B. Consequently, we have limfc-s.oo Xk = X* . Actually, we 
have 

X k = £ {AB T ) 1 (AC T B + C) (A T B) 1 . 

i=0 



10 



5 Revisit of Equation X = AXB + C 



To study matrix equation X — AX H B + C, we should first revisit the following matrix equation 

X = AXB + C, (42) 

where A £ C mxm , B £ C nxn and C £ C mx " are given, and X £ C mx ™ is to be determined. This equation 
was first studied in 13 . To introduce the results obtained in that paper, we need some preliminaries. For a 
complex matrix A — A\ + \A 2 where i = y—l, the associated real representation is 



A n 



Ai A 2 
A 2 -A 1 



(43) 



The real representation has many good properties, one of which is the following. 

Lemma 8 (Proposition 3.2 in gBj) Let A £ C mxm . If a £ \ {A a }, then {±a,±a} cA{4} . 



Based on the real representation, the following elegant result was established in [T3] regarding solutions of 
equation 



Proposition 2 ('131) The equation is solvable if and only if the following equation 

Y = A a YB a + C CT , (44) 
is solvable with unknown Y . Moreover, let Y be any rea0 solution of \44\ )> then the solution of equation \4>ty 



X=-[l m il m ] (Y + Q m YQ n ) 



In 



where 



o I. 

-Is 



(45) 



To use this result, we need to solve linear equation (|44l) whose dimension is twice of the original matrix 
equation (1421) . In this section, by using the idea used in Section 01 we will give an alternative criterion for 
the solvability and computation methods for equation (|42[) . 



5.1 Some Useful Results 



In this subsection, we give some useful results that will be used later. 



Lemma 9 Let A = A\ + \Ai £ C mx ™, where A\,Ai £ R mxn , be any given complex matrices. Then 



where Z n is a unitary matrix defined as 



A x -A 2 
A 2 A 1 



A 
A 



7 H 



(46) 



Moreover, for two arbitrary matrices U £ C mx ™ and V £ C nxq , there holds 

u«v a = {uv),. 



(47) 



1 Though it is not mentioned in |13| that Y is real solution, it indeed should be so. 
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Proof. Equation P5|) can be shown by direct manipulation. We next prove (|4"Tj) . By letting U = XJ\ + 1C/2 
and V = V\ + iV 2 , where t/i and Vi are real matrices, we have 



U a V a 







^2 




V2 


-Vi _ 



C/i t/ 2 
t/ 2 -Ui 

U1V1 + U 2 V 2 U X V 2 - U 2 Vi 

U 2 Vi - UiV 2 U1V1 + u 2 v 2 

= {U x Vx + U 2 V 2 + i [U 2 Vi - U X V 2 )) 

= {{U l +iU 2 ){V 1 -W 2 )) (j> 



which is (|4"T)) . The proof is finished. 



Let A = A\ + iA 2 £ C mxn be any given matrix, where A\ and A2 are real matrices. Then the linear mapping 
tp (A) : C mxn R 2m ™ is defined as 

" vec(Ai) 
vec(A 2 ) 



<p(A) = 

We notice that mapping tp (•) is bijective and denote 

ip^ 1 (c) = vec^ 1 (ci) + vec^ 1 (c 2 ) i, c = 



Cl 
C2 



eR 2 



ci G R r ' 



Here we have assumed that vec 1 (•) is defined associated with vec(-) in an obvious way. The following 
lemma gives further properties of tp (A) . 



Lemma 10 Let A e CP xm and B e C nx i be given. Then for any X e C mxn , there hold 

p(AXB) - (B T ®A) a p(X), 
p(AXB) = (B T ®A) tp{X). 



(48) 
(49) 



Proof. Let A — A\ + iA 2 ,B = B\ + iB 2 ,X = X± + \X 2 where Ai,Bi and Xi,i = 1,2 are real matrices. 
Then Y = AXB where Y = Yi + iY 2 € c mx " with Yi and Y 2 being real matrices, is equivalent to 

Yt = {A 1 X 1 + A2X2) B 1 - {A 2 X X - A X X 2 ) B 2 , 
Y 2 = {A 2 Xi - AiX 2 ) B x + (A1X1 + A2X2) B 2 . 

Taking vec(-) on both sides of the above two equations, we get p (Y) = T (A, B) tp (X) , where 



T(A,B) = 



B 2 l ® A 1 



Bj ® Ai - Bj ® A 2 Bj <E> A 2 
Bj (& A 2 + Bj <g) Ai Bj <E> A 2 - Bj ® Ai 

A 2 + Sj(g)Ai)) 



In view of the real representation in (1431) , we can see that 

T (A, B) = ((Bj ® Ax - Bj ® A 2 ) + i (Bj 
= ((B 1 r + iB 2 T )®(A 1 +iA 2 )) £7 
= (B T ®A) ff , 



(50) 



which is (|4"g|). 
Next we show 



By using (|4"T|). (|4"5)l and by definition of </? (•), we have 
tp {AXB) = (B T ®A) a tp (X) 

= (^®A)\ I J 

= (B T ®A)JI mn ) a tp(X) 
= ((B T ®A)7 mn )^{X) 
= {B T ®A),p{X). 



tp(X) 
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The proof is thus finished. ■ 

The following lemma regarding properties of real representation is important in our further development. 



Lemma 11 Let A = A\ + \A 2 £ C mxm where A\ and A2 are real matrices. Then 

rank (A CT ) = 2rank (A) , 
rank (I 2m — A a ) = m + rank (l m — AA) . 

Proof. We first show equation (l5Tj) . By using Lemma GO we have 

rank (A^) = rank(A ff (I m ) a ) = rank(A0) 

, , A 
— rank 



A 

= rank (A) + rank (A) 
= 2 rank (A) . 

We next show (|52p . Invoking of (jT7)l and (|46l) we obtain 

rank (7 2m - A^) = rank ((J m ) a - A ff (I m ) CT ) 
= rank((7 m ) CT - A4) 
= rank (Z* ({I m ) a - A^) Z n 



rank ( (I m ) CT Z m - 
rank 



A 
A 



-A -il m 

Urn -A 



rank 



rank 



--4 -i/ m 

Urn --4 



-i/ n 
.4 



m + rank (/„ 



The proof is finished. 



(51) 
(52) 



5.2 Solvability of Equation X = AXB + C 

Based on linear equation theory, to study the inhomogeneous matrix equation (j42|) . we should study the 
homogeneous matrix equation X = AXB. Associated with this equation, we define a set J7" C C mxn as 

y' = {l£ C mxn : X = AXB} . 

For any X 1 £ 5^" and X 2 £ S*' and any a £ C, we have 



(Xi + al 2 ) - A al 2 ) B = al 2 - aAX 2 B, 

which is not zero in general, namely, X\ + aX 2 in general. Therefore, St" is not a linear subspace 

contained in C mx ™ over the field C. Alternatively, if we define a real set 5? C R mn as 

S? = {p(X) : X = AXB,X £ C mxn } . 

then by using the fact that mapping (p (•) is bijective and (|48p. we have 

y = {ip(X) :ip(X) = Lp (AXB) , X £ C mxn } 

= {lp (X) :<p(X)= (B T ® A) a Lp (X) , X £ C mx "} 

= {x : x = (B T ® A) ff 1, a G R 2m "} 

= : (hmn - (B T <g> A) J x = 0, x G R 2 " 1 "} , (53) 
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which is clearly a linear subspace contained in R 2mn over the field R. Moreover, we can see from (|5^|) that 

dim = 2mn - rank (l 2mn - (B T ® A) J . (54) 

where dim (^) denotes the degree of freedom (measured by the maximal number of real parameters) in the 
solution. Therefore, is more suitable for characterizing the general solutions of equation (|4"2"j) . In fact, if 
we let 

! = C mx ™ : X = AXB + C) , 

and X* be any particular solution of equation ((42j) , then 

SC = {X t + ( P - 1 (x) :Vxey}. 
To present our main result, we need an auxiliary result. 



Lemma 12 Equation ffffy ^ las a unique solution for arbitrary C if and only if 

7/7 7M, Vr/ e \{AA} y~/ e \{BB} . (55) 

Proof. Taking (p (•) on both sides of l|4"2")) and using (|4"5|) give 

(hmn -{B T ®A) (y )i P {X) = v{C). (56) 



Therefore, equation ()42j) has a unique solution for arbitrary C if and only if 

rank (j 2mn - (i? T (8^4)) = 2mn. (57) 
On the other hand, in view of Lemma lll| we can see that 



rank 



(hmn - (B T ® A) J = ran + rank (j m „ - (B T <g> A) ^i? T 



miJ + rank ( I mn - (BB) T ® AA) . (58) 



Hence, ([57| is true if and only if 



rank (l„ m - (BB) T (g> AX) = 



or, equivalent, 1 ^ A . This is further equivalent to (|55|) . The proof is completed. ■ 

We then can give the following alternative conditions regarding solvability of equation (|42[) . which can be 
regarded as the generalization of Theorem [1] to equation (l4"2"j) . 



Theorem 4 Matrix equation \4>ty * s solvable if and only if the following equation 

W = AAWBB + ACB + C, (59) 
is solvable with unknown W . More specifically: 

1. If X is a solution of equation (Ul|), then W — X is a solution of equation i59\) . 

2. If W is the general closed-form solution of equation &59\) . then the general closed-form solution of 
equation is given by 

X = - (W + AWB + C) . (60) 



3. Equation G5p has a unique solution X* if and only if equation S59\) has a unique solution W* . More- 
over, X* = W*. 
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Proof. Proofs of Item 1 and Item 2. Taking ip (•) on both sides of (|59p . using Lemma 1101 and denoting 

E = B T (g> A, we get 

<p(W) = ip (AAWBB) + y (ACB) + v (C) 
= E aV (AWB) + (7 2m „ + E a ) ^ (C) 

= El<p (W) + {hmn + E a ) ip (C) , 

which is further equivalent to 

(hmn ' El) <p (W) = (I 2mn + E a ) if (C) . (61) 



Similarly, equation (|42|) is equivalent to (|56j) which can be rewritten as 

(l2mn-E a )<p(X)=<p(C), (62) 

and the expression in (|60|) is equivalent to 

<P{X) = ± (hmn + Ei (j ) y> (W) + ^ (C) . (63) 

Since equation (f6"T|) is in the form of (|T5|) . equation (1521 is in the form of (jTHJ) and expression ([53")) is in the 
form of p7|) . Item 1 and Item 2 follows from Proposition Q] directly. 

Proof of Item 3. Notice that, by using Lemma Q21 equation (|42l) has a unique solution X* if and only if 
([53)1 is satisfied, which is also the necessary and sufficient condition for the existence of unique solution of 
equation (|59[) . The proof is finished. ■ 

Remark 4 The difference between Theorem^ and Theorem^ is that condition 155\) in Theorem^ is neces- 
sary and sufficient for the existence of unique solution of equation 

Remark 5 Different from Proposition the solvability condition for equation |^l?| ) in Theorem [^] is related 
with solvability of equation i59\) which has the same dimension as \4 l 2fy . Also, different from Proposition 
@ we have given in Theorem [^] a necessary and sufficient condition for the existence of unique solution of 
equation 

The following corollary is then a consequence of Theorem |4j 

Corollary 1 Equation |^^[ ) is solvable if and only if 

rank (l mn - (BB) T ® AA) = rank ( I mn - (BB) T (g>AA (B T <g) A) vec (C) + vec (C) ) ■ (64) 

// the above relation is satisfied, then the degree of freedom ( measured by the maximal number of free real 
parameters), namely, dim (.y) , in the solution is given by 

dim ( .y) = mn - rank (l mn - (BB) T <g> AA \ . (65) 



5.3 Relationship Between Equations X = AXB + C and Y = A a YB a + C a 

From Lemma [2] we can see easily that if equation (|44[) has a unique solution, then equation (|42[) must also 
have a unique solution. But the converse is not clear. In this subsection, by using the results obtained in 
the above two subsections, we are able to close this gap. Our result reveals some deep relationships between 
these two equations (|4"2"j) and (144]) . 

Theorem 5 Equation \^2^ has a unique solution X* if and only if equation | Ep has a unique solution Y* . 
Moreover, there holds Y* = X*. 
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Proof. Obviously, equation fH| nas a unique solution for arbitrary C a if and only if 

a/3^1, VaeA{4},V/3eA{B ff }. 

For an arbitrary matrix A = A\ + 1A2 G C mxm where A\, A2 G R mxm , we can compute 



(66) 



det (sI 2m -A a ) = i m det 
= i m det 
= i m det 












Im 









2 °- f m 


r 5 y 



det 



-v4 2 s/ m + Ai 

sl m - Ax -A 2 
-A 2 sl m + Ai 



det 



?-T T 

2 ± m ± m 
2 m L± rn 



2 i^m 



1.4 



i'" <H- ( -2.*/,,, ) d.'t. ( l -sl m (slm - A) (-2sl m ) 1 L4 



= i m det ( (-2s/ m ) ( l -sl m (sl m - A) LA 



= i ro det(-i(s 2 J m - AA)) 
= det (s 2 l m - AA) 
= det (s 2 I m - AA) . 

where we have used Lemma [3] 



(67) 



Let a and j3 be two arbitrary elements in A{A CT } and \{B a } , respectively. Then it follows from Lemma |5] 
that {±a,±a} C A {A a } , {±/3, ±/3} C \{B a } and it follows from JB7|| that {a 2 , a 2 } C A { AA} , {/3 2 ,/f} C 
A {-£>£?} • Direct manipulation shows that 

st ^ l,Vs G {±a,±a} C A{A ff },Vt € {±£,±/3} C \ {B a ] , 

if and only if a/3 7^ ±1 and a/3 ^ ±1. Also, direct computation indicates that 

st ^ l,Vs G {a 2 , a 2 } C \{AA} ,Vt G {/3 2 ,^ 2 } C A {SB} , 

if and only if a/3 7^ ±1 and a/3 7^ ±1. According to the arbitrariness of a and /3, we conclude from the above 
that ([55| and (|66|) are equivalent. 



Finally, it is trivial to show that Y* = X*. The proof is completed. ■ 

Proposition [2] and Theorem [5] are still unable to answer the following question: If Y is the general real 
solution of equation (|44|) . then is the following 



Im llm } (Y + QmYQn) 



In 
LTr, 



(68) 



the general solution of equation P2")l ? The following example may indicate that the answer is positive. 
Example 3 Consider a linear equation in the form of HM with 

B = 1, C 



A 



2 
i 







Cn + iCi2 






C21 4 iC'22 



where Cij,i,j = 1,2 are real scalars. Then equation &59\) can be written as 



W = 



Wl 

w 2 



= AAWBB + ACB + C = 



4wi +2Ci + Ci 
W2 + 1C2 4- C2 



which is solvable if and only if 1C2 + C 2 = 0. Consequently, the general solution of equation &59\) is 



W 



(2d + d) 
t 



, Vi = h + i< 2 G C, 
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where t\ and t% are real scalars. Hence, according to Theorem ^ the general solution of equation 
given by 



X = - (W + AWB + C) = 



-i(d + 2Ci) 
|(l+i) {h+t 2 ) + lc 2 



On the other hand, direct computation shows that the general real solution of equation [4-4]) is 

-C u 

Y = 2 lJ 2 lJ j_ J^ 21 

















*- 


*- 


+ 























aCl2 
In.. 



|Cl2 
5C22 

Cu 
-5C21 



,Vs e r. 



TTien JC' defined in \68\) can be computed as 

X' = 

It is clear that X' = X if we set t\ + ti = s in X. 



5C2 



i(l + i).s 



However, at present we cannot prove the correctness the above result in general. This would be a conjecture 
that needs further study. 



Conjecture 1 If Y is the general real solution of equation J^i?[ ), then the general solution of equation 
is X' defined in H68]) . 



We should point out that even though the above conjecture is true, compared with the results in Theorem 01 
using solutions of equation (|4"4")l to construct solution of equation (j4"2"|) has no advantage since the dimensions 
of equation (|4"4l has doubled the dimensions of equation (|5^|) used in Theorem 01 

5.4 Closed-Form and Iterative Solutions of Equation X = AXB + C 

By combining Lemma [3] and Theorem @] we can derive the following corollary regarding closed- from solutions 
of equation (|42p . We notice that this result is exactly Theorem 4.4 in [T3] where the real representation 
method is developed to derive such result. In contrast, in this paper such result is only a consequence of 
Theorem 0] and Lemma [3] without using the intricate properties of the real representation. 

Corollary 2 Let A € C mxm ,B e C nxn ,C € C mxn be given and 

m 

h AA ( s ) = det (I™ ~ sAA) — a m + ^2 a m-is\ a m = 1. 



1. If equation \4'>ty has a solution X, then 

m k 



i=l 



Xh A -j (BB) = YjY, ak ( AA ) k ' ( ACB + C ) ( BB 



k=l s=l 

2. If equation \4 l 2fy has a unique solution X* , then 

/ m k 

, k- 



X * = EE"* ( AA ) ( A CB + C) (BB) m - s (h^BB))- 



\k=i s=i 



Similar to the Smith Iteration for equation (1261) . we can also construct the Smith iteration for equation (|42[) 
as follows: 

X k+1 = AX~ k B + C, VI„eC mxn . (69) 
The following theorem is concerned with the convergence of this iteration. 
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Theorem 6 The Smith iteration \69\) converges to the unique solution X* of equation J4-2| ) for arbitrary 
initial condition Xq if and only if 

p(AA) p(BB) <1. (70) 
Moreover, the asymptotic exponential convergence rate is - In (p (AA) p (BB)) . 

Proof. We first show that if the iteration (|69|) converges, then (1701) should be satisfied. As {Xk}^L 
converges, we know that the subsequence {X 2 i}°l also converges. It is easy to verify that {X 2i }°Z Q is 
generated by the following iteration 

X 2(i+l) = AAX 2l BB + ACB + C, VX e C mxn . 

Notice that the above iteration is a standard Smith iteration which converges to a constant matrix indepen- 
dent of the initial condition if and only if (l70|) is true. 



We next show that if ([TO")) is satisfied, then the iteration (|69p must converge. Notice that the iteration (jS§)) 
can be equivalently rewritten as 

{X k+1 ) <r =A r (X k ) <r B„ + C <r , V(I„) s eR M ". (71) 

From (|S7j) we can see that 



P (A a )p(B a ) = y/p(AA)y/p(BB) < 1, 

which indicates that (|71[) converges to a constant matrix (Xoo)^ independent of initial condition, namely, 
(^oo)^ = A a (X 00 ) o . B a + Cj, or equivalently, = AX^B + C. The proof is completed by observing that 
equation (|42|) has a unique solution under the condition (|70|) . ■ 



Remark 6 Under condition |70[ ), it follows from Theorem^ that the unique solution X* to equation |^l?| ) 
is also the unique solution of equation &59\) , namely, 

oo 

x * = £ ( AA 1 1 ( A ~c B + c ) ^ B ) 1 ■ ( 72 ) 

Therefore, similar to Remark^ and Remark^ we can also construct Smith (I) iteration and r -Smith iteration 
for solving equation \4>ty- The details are omitted due to limited space. 



6 Equation X = AX R B + C 

In this section, we study the following matrix equation: 

X = AX K B + C, (73) 

where A G C mxn , B e C mx " and C G C mxn are given, and X e C mxn is unknown. In this section, we will 
generalize the results for equations (Ptj]) and (j4"2l to this equation. 

6.1 Solvability of Equation X = AX R B + C 
We first show a result parallel to Lemma [12] 

Lemma 13 Equation /ias a unique solution for arbitrary C if and only if 

777^1, V7 ? , 7 e A{AS H }. (74) 
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Proof. Notice that, for any complex matrix X <E Qrnxn^ we h ave 



vec 






vec 







^(m.n)VeC (Xi) 
J P(m,n)VeC (X 2 ) 



Pi 



(m,n) 



where 

V (m,n) - () j, 

With this, by taking <p (•) on both sides of 1(75]). using Lemma [TU1 and denoting £ = B T A, we 

tp (X) = <p (AX H B) + p (C) 

= (B T ®A) a <p(X T )+<p(C) 
= E a V M <p{X) + l p{C). 

Therefore, equation (|73|) has a unique solution for arbitrary C if and only if 

rank {l 2mn - EoV^n)) = 2mn. 



Similar to the proof of relation (|52[) . by using Lemma [SJ we can compute 

rank (l 2mn - E a V( m ^ n )) = rank (7>( n , m ) - 

= rank ('P(n.m) (^mn) CT - (JjJ 

= rank (7>( n ,m) (^mn) CT ~ 

= rank (Z^ n (V{ n ,m) {Imn) a ~ ^0) ^mn) 



= rank \Zm n P(n,m) {Imn)^ 
= rank f2 — 



" E 












) 



o 



where 



0^ 



Hence, we get from (|77|) that 



P(m^n 




) o 








„)) = 


rank ^ 




rank ^ 




rank ^ 




rrm + r 



P, 



(m,n) 
J-mn 



P(m,n) EP (m,ri) 



inn 

E 



rnn 

E 



i-^mn 
*mn P(m,n)EP(m,n) 



nk (/ mn — EP( m n )EP( m ^ n )) . 



Notice that 



EP( m , n) EP (m , n) = EP {mtn) (B T ® A)P (m ,„) 
= ^(m,») (B H ® 3) ^(m,») 

= P T 3 ® AB H 
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Hence it follows from (|75|) that condition ([75]) is true if and only if 

1 i a{(A h B) T ® AB h } (79) 

which is equivalent to (j74|) because 

A |(A H _B) T <E> /LB H j = A {B T ~A® t4£? h } 

= a{abh® AB h } 

= {rj 7 , V»7, 7 G A { Ai? H } } . 

The proof is completed. ■ 

We are now able to present the results regarding the solvability of equation (1751) . 

Theorem 7 Matrix equation 175] ) is solvable if and only if the following equation 

W = AB H WA H B + AC H B + C, (80) 
is solvable with unknown W. More specifically, 

1. If X is a solution of equation \73\ ), then W — X is also a solution of equation 

2. If W is the general solution of equation then the general solution to equation J7ff| ) is 

X = - (w + AW H B^j + -C, (81) 



3. Equation |7<5f ) has a unique solution X* if and only if equation i80\) has a unique solution W* , namely, 
(Zfy is satisfied. Moreover, X* — W* . 

Proof. Proofs of Item 1 and Item 2. For any matrices A g c mxn ,P e pj> xm and Q € R nxq , it cam be 
readily verified that 



[ PA Q)a = 

By using this fact, Lemma Q] and Lemma [TUI we obtain 



P 
P 



A n 



Q 
Q 



cp (W) — ip (AB T WA T B^j + ip (ACFB) + p (C) 
= E a p (B T WA T ) + E a p (C T ) + ^ (C) 
= E a (A® B T ) a p (W) + (J 2m „ + E a V {m , n) ) <p (C) 

= E a {P {m ,n) (B T <8> A) P( m ,n)) a P (W) + (hmn + E a V [m . n) ) p (C) 
= K'P( m ,n)Ea'P {m ^ ) p (W) + (J 2m „ + E a V {m ^ n) ) tp (C) , 

where E = B T ® A and V(jn, n ) is defined in ([75]). It follows that 

{limn - (-B CT ^( m ,n)) 2 ) ¥> ( W ) = (hmn + E a V( m , n )) If (C) . 

Similarly, equation ([73]) and expression ([8Tj) are, respectively, equivalent to 

(hmn ~ Ea'P(m,n)) = P (C) , 

and 

p(X) = ^ (I 2mn + E a V M ) ip [W) + X -p (C) . 
The remaining of the proof is similar to the proof of Theorem [1] and is omitted for brevity. 
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Proof of Item 3. Clearly, equation (|5U|) has a unique solution if and only if (1751) is satisfied. By using Lemma 
ITS1 (|75|) is equivalent to which is just the condition for the existence of unique solution of equation (|73[) . 
The remaining of the proof is similar to the proof of Item 3 of Theorem [1] The proof is done. ■ 

The advantage of Theorem [7J is that to solve equation (fT3"|) , we need only to consider the standard Stein 
equation (|80|) which is again in the form of (|9|). 



Similarly to the set 5? defined in (|53[) , if we let 

ST = {x : (l 2mn - E a V M ) i = 0,i£ R 2 "" 1 } , 

which is a linear subspace contained in R 2 ™" 1 over the field R, then we can obtain the following corollary 
that parallels Corollary [1] 



Corollary 3 Equation \7Sfy is solvable if and only if 

rank (l mn - (A H B) T <g> AB H ) = rank ( [ I mn - (A K B) T <g> AB K (B T ® A) vec (C H ) + vec (C) 

// the above relation is satisfied, then the degree of freedom ( measured by the maximal number of free real 
parameters), namely, dim(^) , in the solution is given by 

dim {ST) = mn - rank (l mn - (A R B) T ® AB H ) . 
6.2 Solvability Conditions Based on Real Representation 

In this subsection, by using real representation of complex matrices, we generalize the results in |13j (namely, 
Proposition [2]) to equation (|73|) . 



Theorem 8 Equation is solvable if and only if equation 

Y = A a Y T B a + C a , 
is solvable with unknown Y . More specifically, 



(82) 



1. If X is a solution of equation then Y — X„ is a solution of equation 

2. If Y is a real solution of \82]) , then a solution of equation is 



X=-[l m H m ](Y + QmYQn) 



In 



3. Equation J7j| ) has a unique solution X* if equation 182\) has a unique solution Y* , namely 1 (jz. 
A { (Bj ® P( 2m .2n) } ■ Moreover, Y* = X*. 



Proof. Proof of Item 1. Let A = A x + L4 2 , B = B x + iB 2 and C = C\ + iC 2 where A u Bi and Q, i = 1, 2 
are real matrices. Then by substituting X = X\ + 1X2 where Xx,X 2 £ R mx ™ into ([73^1 . we obtain 

X 1 = (AxX? + A 2 xn B x - (A 2 Xj - A X XJ) B 2 + C u 
X 2 = {A 2 X 1 T - AiX?) B x + (AxX? + A 2 Xj) B 2 + C 2 , 

which is also equivalent to 





x 2 






' Ax 


A 2 




' Xx 


x 2 


T 


' Bx 


B 2 




' Cx 


c 2 


x 2 


-Xx 






A 2 


-Ax _ 




x 2 


-Xx _ 




B 2 




+ 


c 2 


-Cx 



This indicates that Y = X a satisfies (1821) in view of the real representation (|43 
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Proof of Item 2. Let 



be a real solution of equation 



Y 



Yi Y 2 
Y 3 Yi 



, Yi G R" 



1,2,3,4, 



Note that Q m A a Q n = A a ([13]) and 



Qm 



I m ' 


-1 


-I m ' 


-I m 




Im 



Q m - 



Then by multiplying the left hand side and right hand side of ()82|) by respectively Q m and Q n , we obtain 

QmYQri — QmA a Y BcjQn ~t~ QmC^Qn 

Q m-AfjQ n Q n Y Q. m Qm^aQn H~ QmC^Qn 

= AuQ^Y^ Q^Bu + Ccr 
= A a (Q m YQn) B a + C a . 

This indicates that Q m YQ n is also a solution of equation (|82|). Consequently, 

}> = ~ {Y + Q m YQ n ) 

= 1 [ Y x - Yi Y 2 +Y 3 
2 [ Y 3 + Y 2 - (Y 1 - Yi) 

Xi X 2 
X 2 —Xi 

. Therefore, the following matrix 



is also a solution of equation 



X 4 Xx + \X 2 = i [ J„ 



is a solution of equation ([73 



Proof of Item 3. We need only to show that if equation ([82} has a unique solution then equation (f73|) also 
has a unique solution. We show this by contradiction. Assume that equation (|73[) has at least two solutions, 
say, X 1 and X 2 with X 1 7^ X 2 . Then, according Item 1 of this theorem, both X\ and X 2 are solutions of 
equation |82|) . Clearly, we have 7^ which yields a contradiction. The proof is finished. ■ 

The following example indicates that the converse of Item 3 of Theorem [8] may also be true. 

Example 4 Consider a linear equation in the form of J73| ) with A and B given in \38\l . Then equation \82\l 
has a unique solution if and only if 

l£\{{B T a ® A a ) P {2m , 2n) } 
= |±2,±V]a[,a,a,±v / 2^,±\/2s| , 

namely, \a\ ^ 1 and a 7^ |. On the other hand, since A|A_B H } = {2, a}, the condition in J7^[ ) is also 
equivalent to \a\ 2 7^ 1 and a^i. This implies, for A and B given in \38\) . that equation 1182)) has a unique 
solution if and only if equation J7ff[ ) has a unique solution. 

However, we also cannot prove the above statement in general. We state it as a conjecture which can be 
regarded as the generalization of Theorem [5] to equations ([73]) and ([82} . 



Conjecture 2 // equation has a unique solution X* , then equation \82\l has a unique solution Y* . 

Moreover, there holds Y* = X*. 

Even though the above conjecture is true, compared with the results in Subsection 16. 1[ there is no advantage 
by transforming equation (I73[) into equation (|82[) via its real representation. 
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6.3 Closed-Form and Iterative Solutions of Equation X = AX H B + C 
Based on Theorem [71 we can extend Corollary [5] to equation ([75)1 without a proof. 
Theorem 9 Let A € C mxn , B € C mxn , C G C mxn and 

m 

hAB H (s) = det (/ m - sAB H ) = a m + 2J Q£m-i«*) a m = 1- 



1. If equation fi73\) has a solution X, then 

m k 

Xh ABH (A H B) =J2J2 ak ( AB ") k ~ a ( A C"B + C) (A K B) 

k=l s=l 



2. If equation J7ff[ ) has a unique solution X* , then 

/ rn k \ 



U=ls=l / 

We next consider the Smith iteration associated with equation (f73|) . Similar to ([39j) . this iteration is in the 
form of 

X k+1 = AXfB + C, VA e C mxn . (83) 
The following result is about the convergence of the above iteration. 

Theorem 10 The Smith iteration 183\) converges to the unique solution X* of equation |7.5j ) for arbitrary 
initial condition Xq> if and only if 

p(B H A) < 1. (84) 
Moreover, the asymptotic exponential convergence rate is -ln(p (B H A)) . 

Proof. Similar to the proof of Theorem[6l we can show that if (|83j) converges, we must have (l84[) . Therefore, 
we need only to show that if ([54]) is satisfied, the iteration in ([53]) must converge to the unique solution of 
equation ([73")) . 

By using the real representation and Theorem [3J the iteration in (I83[) is equivalent to 

(X k+1 ) a = A a {{X k ) a ) T B a + C a . (85) 

According to Theorem [3J the above iteration converges to a constant matrix independent of the initial 
condition if and only if p (A^Bj) < 1. According to Lemma[H we have 



p(A a Bl)=p{{AB*) 

= max {p (AB H ) , p (AB 5 ) } 
= P (AB*) . 

Hence, if (|84[) is satisfied, iteration f|85]) converges to a constant matrix Xoc satisfying (Xoo)^ = A a ((X 00 ) a ) T B a 
C fj , or equivalently, X^ — AX^B + C. The proof is completed by observing that the equation ([73[) has a 
unique solution under condition 



Remark 7 Under condition {8$ , the unique solution of equation \73\j is also the unique solution of equation 
\80\), namely, 



X* = J2 (AA K Y (AC B B + C) (B H B) 



i=0 



Therefore, Remark\^and Remark\^are also applicable to equation \7!ty . 
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We finally use a simple numerical example to end this section. 



Example 5 Consider a linear equation in the form of \7Sfy with 



A 



and 



1 

-2 
1 - i 



C = 



—l 
-1 



B = 



i 1 
i 
1 + i 3 



-5 + i -4-i -5-12i 
2 - i —4 — 2i 6 + 8i 
l + 3i 15 — 5i -4-5i 



-1 
2 + i 
— i 



It is readily to verify that the eigenvalue set of AB^ satisfies the condition in |7^[ ). Then it follows from 
Theorem^that the matrix equation in {7 Sty has a unique solution which can be obtained by using Theorem 
[5| as follows: 

~ l + 3i -2 
X 1 2-i 1 

-2 2 2 + i 



7 A General Equation 



In this section, we point out that the results obtained above can be easily extended to a general class of 
matrix equations having more terms on the right hand side, say, 

N 

X = J2Af(X)Bi + C, (86) 

i—0 

where N > 0, A, E C mxm ,S i e C nxn ,i € {0,1,..., N} and C € C mx " are given, and X e C mxn is to 
be determined. In fact, by using Proposition [T] and a similar technique used in Section we can extend 
Theorem [4] to equation ([86]) , as stated in the following theorem. 

Theorem 11 Matrix equation \8b]) is solvable if and only if the following equation 

N / N \ N 

w = A i E f ( Ak f Bk ^ ) B * + E A *f ( a ) B i + a ' ( 8? ) 

i=0 \fe=0 / i=0 

is solvable with unknown W. More specifically: 



1. If X is a solution of equation f86\) , then W = X is a solution of equation {81$ . 

2. If W is the general closed-form solution of equation {8Tty , then the general closed-form solution of 
equation \8b}) is given by 

X = \(w + Y^A t f(W)B l + C^ . 

3. If f (X) = X or f (X) = X H , then equation i86]) has a unique solution X* if and only if equation \87\) 
has a unique solution W* . If f (X) — X T , then equation S86\) has a unique solution X* if equation 
$87]) has a unique solution W* . In both cases, X* — W*. 



8 Conclusion 



This paper is concerned with solvability, existence of unique solution, closed-form solution and numerical 
solution of matrix equation X — Af {X) B + C with f (X) = X T , f (X) = X and / (X) = X R , where 
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X is a matrix to be determined. It is established that the solvability of these equations are equivalent to 
the solvability of some auxiliary standard Stein matrix equations in the form of W — AWB + C where the 
dimensions of the coefficient matrices A, B and C are the same as dimensions of the coefficient matrices of 
the original equation. Based on the auxiliary standard Stein matrix equation, the conditions for solvability 
and the existence of unique solution are proposed. Closed-form solutions are also obtained by inversion of a 
square matrix, which is a generalization of the standard results on the standard Stein equation. Numerical 
solutions are approximated by iterations which are generalizations of the Smith iteration and accelerated 
Smith iteration associated with the standard Stein equations. We should point out that the idea in this paper 
can be readily adopted to study equations in the form of AX — f (X) B = C with / (X) = X T , f (X) = X 
and / (X) = X H . 
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